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This paper describes modeling solar power generation as a renewable energy 
generator by simulating the analytical approach mean absolute error and root 
mean square error (MAE and RMSE). This research estimates the error 
referring to long short-term memory (LSTM) network learning. Related to 
this, the Indonesian government is currently actively developing solar power 
plants without ignoring the surrounding environment. The integration of 
solar power sources without accurate power prediction can hinder the work 
of the grid and the use of new and renewable generation sources. To 
overcome this, virtual power plant modeling can be a solution to minimize 
prediction errors. This study proposes a method for on-site virtual solar 
power plant efficiency with a research approach using two models, namely 
RMSE and MAE to account for prediction uncertainty from additional 
information on power plants using virtual solar power plants. A prediction 
strategy verified against the output power of photovoltaic (PV) modules and 


a set based on data from meteorological stations used to simulate the virtual 
power plants (VPP) model. This forecast prediction refers to the LSTM 
network and provides forecast errors with other learning methods, where the 
approach simulated with 12.36% and 11.85% accuracy for MAE and RMSE, 
respectively. 
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1. INTRODUCTION 

The involvement of energy use continues to increase resulting in the availability and stability of 
world energy commodity prices, but also opportunities for cooperation as well as competition between 
countries in the Asia Pacific region. Indonesia is currently planning to continue developing renewable energy 
with a policy of around 23% of its energy being supplied by modern renewable energy by 2025, with at least 
31% coming from modern renewable energy sources by 2050. According to the international energy agency 
(IEA), energy demand continues to increase by around 30% by 2015-2040 [1]. PT PLN has discontinued 
several steam power plants until 2056, with initial participation in 2030 and concentrating on renewable 
energy [2]. 

Solar energy is an effective form of solar energy and a renewable energy solution that can reduce the 
greenhouse effect and global warming [3]. Indonesia has a renewable energy potential of 4.80 kWh/m?/day 
and only uses 10 MWp in 2024 [4], whereas Indonesia has relatively large solar energy of around 0.87 GW 
in 2025 [5]. In addition, Indonesia also produces electricity from solar energy with a potential of around 
640,000 TWh per year, thus enabling the achievement of energy mix targets that can reduce greenhouse gas 
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emissions through the development of renewable energy [6]. Market demand for solar energy has been driven 
by government incentives and tax breaks for installing solar panels and increasing environmental pollution 
[7]. The need for renewable energy in the global market continues to increase by around 25.9% until 2030 
[8]. Several factors are driving the spread of solar energy, including the decline in the price of solar panels, 
and environmental, political, social, and cultural issues. Based on research from Sovacool et al. [9] that, 
global installed capacity is affected by identified constraints including the pace of achieving solar dominance, 
lack of investment for efficiency, and regulations governing the distribution of solar power. Coordination of 
photovoltaic (PV) installations with virtual power plants (VPP) power generation models for global 
production prediction provides accurate values integrated into the network [10]. 

Strategy and power prediction at PV and prediction interval chosen for PV facilities in the VPP 
model can be replicated for real PV and can improve high accuracy with two models mean absolute error and 
root mean square error (MAE and RMSE) with values of 12.37% and 11.84% respectively [11], [12]. The 
limitations of technology in the past have an impact on the current electricity network, so it is necessary to 
fundamentally restructure fossil energy to serve the energy needs of renewable energy in the future [13]. 
Integral part and participation of energy resources distributed and decentralized small-scale power generation 
units with traditional power generation passively limited to the existing distribution network [14]. One 
possible solution is to combine renewable energy sources (RES) by coming up with the idea of a VPP, which 
is described as a collection of various DERs that function as one unit [15]. The general description of the 
VPP model has been discussed in real-life research [16]—[18]. Power system design with a high proportion of 
RES is an important part of the renewable energy system change [19], [20]. This methodology is based on 
QRF modeling to obtain optimal supply from automatic frequency recovery reserves provided by renewable 
power aggregation [21]. 

Switching to new and renewable energy can help reduce greenhouse gas emissions by limiting the 
impact of extreme weather and climate while supplying reliable, timely, and cost-effective energy. PV 
breakthroughs can be attributed to the increased integration of PV systems into VPPs, which makes this 
energy sales model difficult to implement due to regulatory constraints [22]. The VPP model is the ultimate 
solution offering for reducing the carbon footprint of conventional power grids and the integration of PV and 
wind [23]. This VPP addresses sudden climate change resulting from PV output as well as the management 
of other energy requirements under the same VPP [24]. Major divisions of uncertainty need to be managed 
by the VPP [25]. A common solution for managing PV uncertainty is to invest in energy-storage capabilities 
in batteries [26]. However, the VPP model is expected to have real-time operational capacity to match the 
pre-designed plans based on other PV forecasts [27]. An alignment is performed to get a match from the 
previous market quote to execute a second VPP order on a real market trade [28]. The VPP model deals with 
distributed data collection and management of distributed energy sources to meet certain optimizations [22]. 
VPP discovery and modeling involve complex information and computer techniques related to modeling and 
energy flows [24]. The concept and technology used for VPP are shown in Figure 1. 
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Figure 1. Concept of energy and technology used for VPP 
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Energy predictions together with load demand and energy financing, at different times and resolutions, 
are entered into the equation model. VPP model prediction methods can be differentiated according to different 
factors, for example, parameters to be forecast (radiation), time and resolution constraints, waiting times, model 
predictions, and statistical properties of forecasting [29]. The model directly predicts PV through PV datasets 
and past climate conditions. Indirect forecasting is the prediction of solar radiation and solar energy that can be 
calculated using a PV performance model [30]. Some consideration of intervals for predictions and expected 
uncertainties can provide additional information in making judgments and can provide a reasonable number of 
values and assigned probabilities of those values [31]. To solve this problem, the researcher presents an 
approach to predict PV power by using program tools for the locations studied. 

The contribution of this proposed paper is modeling solar power as a virtual generator and 
prediction using the Rayleigh model with the main contributions being; i) the PV prediction model obtained 
at VPP to minimize forecasting errors by modeling a two-parameter function; ii) the prediction interval time 
for modeling the prediction uncertainty is a function that depends on hold time and launch time with cloud 
cover factor (CCF); and iii) data input strategy for this prediction comes from data sources that have open 
access and are free in cost savings in the VPP model. 


2. METHOD 
2.1. Proposed method 

The strategy of the proposed daily power forecasting framework is shown in Figure 2 to predict the 
proposed daily power, consisting of data input and power prediction preprocessing, and the VPP model. 
Several steps for input data preprocessing as used in training and prediction models. The expected output of 
the prediction algorithm is the energy management system (EMS) input of the observed VPP. Numerical 
weather prediction (NWP) module for irradiation forecasting design and training to estimate power given 
input data from NWP model to generate VPP model after correction by EMS. This process continues until 
the expected solar power plant is produced. 
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Figure 2. Proposed forecasting framework 


2.2. Enter data 

The input data consists of several classifications that are tailored to the source and information 
required. The first input data consists of turbidity and temperature derived from prediction maps with scales 
obtained from the meteorology, climatology, and geophysics agency (MCGA). The radiation turbidity 
prediction data is intended to determine the CCF, which can indicate the cloud area on the shadow 
NWP-based turbidity map at the PV installation. This parameter determines the type: sunny, cloudy, and 
overcast. It is expected to obtain a set of data that can be assigned to different groups in terms of making 
prediction intervals. Where temperature data is used to estimate the temperature of solar panel cells during 
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prediction [31]. Climate maps based on NWP are particularly interesting because of some useful weather 
changes that may not be available at solar installations. Deviations in cell temperature forecasts were 
assessed using data obtained from MCGA. A prediction model with MAE was implemented for temperature 
corresponding to the NWP map of 2.13°C obtaining a power of 46.96 W/m?, for measurements throughout 
the year. The prediction modeling of the dataset for training purposes consists of the sun position, which is 
used in the CCF calculation to determine the daily data, while the space radiation is used as a prediction, and 
the radiation on the inclined plane of the PV module location. 


2.3. Data pre-processing 

To predict the PV power consisting of cell temperature and irradiance on the inclined plane, it is 
necessary to perform quadratic interpolation to predict the NWP. The temperature changes around the study 
can be assumed for the NWP map as proposed by [32]. The exact substantiation with the one-year 
approximation model is shown in Table 1, which illustrates that the ambient temperature is obtained from the 
predicted value measured from the MCGC station for PV module modeling. This parameter allows the daily 
radiation to influence the identification of a specific period to change the PV module in the period of the 
presence of clouds to change the PV generation in the region by preventing the solar radiation which gives an 
exact picture of the calculation of the detected parameters [33]. 


Table 1. Performance model matrix 
Metrics Scale (W/m?) Percentage (%) 


Absolute 1 £ n lor LA 
MAE = > Y,- Y, T|% =Y 
T rMAE = I x100% 
=i tyr y, 
Square Pes m2 
RMSE = T21 (¥ E r) 
rRMSE = TT *100% 


Ft Y; 


2.4. Irradiance modeling and prediction 

Prediction of solar irradiation based on Rayleigh modeling with the aim of i) obtaining the average 
power of the PV model for 30 minutes each day on PV modules and ii) looking at possible uncertainties in 
the forecast results [34]. The data information in Table 1 shows the main inputs for the EMS on the VPP 
module. The modeling and prediction results are shown in Figure 3. To minimize the error of the training 
process by calculating the RMSE and considering the mismatch of the corresponding system, but also 
considering the computation time. Prediction of effective solar radiation on the inclined plane of the PV 
module. The first is to calculate the effective irradiation using data from two irradiating elements on a plane, 
either direct or diffuse. In this case, the albedo is zero and is estimated as a function of the clarity index (kth) 
to account for the mass transfer fraction (k)dh [35]. Based on this information, the conversion to an inclined 
plane is approximated by diffuse radiation and albedo [36]: 


albedo = roghm, (1 — cosg )/2 (1) 


Where ro is the albedo coefficient, with a value of 0.2 ghmo as the global horizontal irradiance (GHI) and £ as 
the panel tilt angle. The effective radiation is determined by considering the angular loss [37] and spectral 
[38] with the p-Si module and medium dust level at DT 0.97 for installation. Model for converting PV power 
from its effective solar radiation [39]. 


Poc = SFnpcPpeak cea (1 + OP (Tee — Tecustc)) (2) 
Where Ppc is the PV estimate, SF serves as a representation of the shadow loss caused by the environment, 
for this particular case, ypc=0.928 including cable losses, with module tolerance and mismatch losses; 
Ppeak=2.98 kW as the peak power of the installation, Gpanei serves as the useful solar radiation of the 
calculated panel, Gsrc is 1 kW/m? which is the solar radiation under standardized testing conditions with 
(STC), Pm is 0.4%/°C which is the temperature change of the PV panel installation, Teen as a function of cell 
temperature, and Teen, src is the cell temperature under STC. The determination of cell temperature is 
expressed by (3), ignoring wind speed, which is a complex effect that is insignificant to the model and does 
not affect the panel facility uniformly [40]: 
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Where Teen, nocr=45°C represents the cell temperature as a function of normal operating cell temperature 
(NOCT); Tamb, vocr=20°C is the ambient temperature under NOCT conditions; Gyocr is 800 W/m? which is the 
solar radiation under NOCT conditions; and Tamy is the ambient temperature resulting from NWP predictions. 
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Figure 3. Illustration of solar radiation at NOCT 


After all the datasets related to PV prediction, it can calculate the prediction interval to predict the 
new one. A prediction interval is a prediction interval with a random variable to judge in the future whose 
magnitude is unknown [41]. In this paper, tabulated prediction intervals are used based on research that has 
been done considering the Laplacian distribution model for errors as a function of lead time, launch time, and 
day type [42]. 

Temperature changes between the module and the air caused by solar radiation can result in heat 
transfer losses and faster heat transfer that occurs in solar insulation damage for a given wind speed with 
thermal resistance and heat transfer coefficients that do not vary much with temperature changes. In Figure 3, it 
can be seen that the change in NOCT for some PV modules in the best case, worst case, and average in the best 
case resembles aluminum fins on the back of the module for cooling which lowers the thermal resistance and 
increases the surface area for heat transfer in other media. Figure 4 shows the time interval for 90% daily which 
has provided valuable additional information from this prediction. PV generation is highly dependent on 
weather conditions and seasonal changes. This can affect the ability of the prediction algorithm to calculate a 
precise prediction and can provide some level of uncertainty that should be evaluated. Prediction levels are used 
to express the level of uncertainty at the point of forecasting which adds to a given confidence level. 
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Figure 4. Interval prediction for PV 
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3. RESULTS AND DISCUSSION 

The research results for this study, obtained from the proposed daily forecasting strategy for VPP 
generation modules can be differentiated in several ways, namely: i) PV power estimation from the 
forecasting output; ii) prediction interval quantitative assessment; and iii) VPP module schedule. First, the 
research results are validated against PV module installations acting as VPP nodes. Secondly, a strategy is 
developed for a replicated VPP model using data-driven meteorological stations. The proposed model is 
evaluated by its effectiveness and compared to its performance to determine its accuracy compared to 
methods proposed in other literature. 


3.1. Predictions for virtual power plants nodes 

Forecasting for real VPP nodes is performed using irradiance measurements taken at existing PV 
facilities at MCGA. GHI forecasting can provide a launch and wait function with a specified time and 
parameters in the future to calculate the prediction interval. The associated error rating will depend on the 
following two model metrics: i) the dependence of the metrics on the MAE and RMSE scales and ii) the 
percentage error of the rMAE and rRMSE metrics. The metrics give an idea of the absolute value of the 
average forecasting information, but the squared value is more sensitive to data deviating from the average 
data and it is the analysis of both that makes it possible to study the overall prediction results. 

A magnitude of the error ratio value that also gives an understanding of what is being done to the 
error is a fair comparison for dependability, but as it approaches zero, a scale-dependent metric is the 
preferred choice. Table 1 is a performance matrix for the value of Y, which is the measured data from time t, 
Y, is the forecast value for time £, and T is the length of consecutive times that can be used to provide a value 
for the accuracy of the algorithm. 

The values Yy, and Y are predictions of Y, at fo and fo is the predetermined start time for each day 
equivalent to sunrise. With the error assessment of the two parameters, it is necessary to determine the 
amount of lead time and launch time. The exact waiting time at (t’-f) gives the difference between the 
specified waiting time and the predicted launch time. The launch time is indicated by (t'-to) and is the 
difference between the current time and sunrise. The launch time and waiting time for a specific day 
prediction are described in Figure 5, where the launch time is set and the waiting time is used as a parameter, 
The prediction vector is obtained when the two parameters are set to specific values. 
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Figure 5. Measurement and prediction results 


Figure 5 illustrates the errors in lead time and launch time leading to the following conclusions. 
First, for the scaling error, a high error rate is observed for short launch times under a moderate timeout. It is 
expected that the scaling error will be mostly under the former setting with high solar radiation, but as the 
launch time increases, this error is significantly reduced. Secondly, the lower the solar radiation, the smaller 
the error scale, and for the error percentage it will be the opposite; when the launch time is small (smaller 
than 1 hour), the high error percentage will be independent of the time limit [43]. Finally, the forecasting 
interval derived from the MAE, possibly a distribution given by dividing the prediction as a function of lead 
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time, launch time, and type of day would be particularly useful at the high level of accuracy required for the 
prediction. The predictions obtained with the forecasting error in this study are much smaller at short lead 
times. The comparison strategy described in this paper regarding estimation can be concluded, that small lead 
times will result in better estimates compared to traditional methods [44]. 


3.2. Photovoltaic prediction from global horizontal irradiance forecast 

To estimate the PV model from the GHI prediction there are several steps, namely i) instantaneous 
time prediction; ii) surveying the location from altitude, latitude, and longitude; iii) installation performance 
covered in the orientation and inclination of the panels, the PV model identifying the parameters available in 
the data sheet and the losses associated with each part of the installation; and iv) from the NWP map on the 
ambient temperature. For the above purposes, it is necessary to perform an analysis technique that allows us 
to quantify the errors made in solving the problem. The results of two different approaches consist of: first, 
making a comparison between the actual PV measurements at the site and the estimated PV values from the 
GHI measurements. Second, the PV module is predicted from the GHI estimated values and evaluates the 
errors associated with all these processes. The main GHI looks for the error value obtained to maintain the 
same performance as obtained in the previous section by using the error to build the prediction interval. 
Figure 4 is a comparison between the measured value of the PV module at the site and the estimated PV 
obtained from the GHI measurement at the site. Choose from three types of days namely; cloudy days, 
cloudy days, and sunny days [45]. The x-axis is the solar time function and the y-axis is the GHI to be 
sought. For the experimental setup at the studied site which has buildings near the PV panels that provide 
partial shading until sunset. Figure 4 is a comparison between the measured value of the PV module at the 
site and the estimated PV obtained from the GHI measurement at the site. Choose from three types of days 
namely; cloudy days, cloudy days, and sunny days. The x-axis is the solar time function and the y-axis is the 
GHI to be sought. For the experimental setup at the site under study which has buildings near the PV panels 
that provide partial shade until sunset. Figure 5 is a comparison between the measured value of the PV 
module at the site and the estimated PV obtained from the GHI measurement at the site. Choose from three 
types of days namely; cloudy days, cloudy days, and sunny days. The x-axis is the solar time function and the 
y-axis is the GHI to be sought. For the experiment setup at the site under study which has buildings near the 
PV panels that provide partial shade until sunset. 

The modeling of this condition is expressed in (2), assuming that the change is linear with time as 
shown in Figure 5 with SF 0.95 at 16:36, decreasing to SF 0.4 at sunset, and also varying depending on the 
season of the year [46] obtained an rMAE value of 2.55% for sunny days, a rMAE=3.05% for partly cloudy 
days, and an increased rMAE value of 4.04% for cloudy days. Observations based on the square error, where 
the rRMSE value is 3.44% on sunny days and the rRMSE is 3.89% on partly cloudy days, up to an rRMSE of 
5.96% for cloudy days. The transition characteristics of the MPPT inverter control in the presence of passing 
clouds cause the inverter operational point to become unstable. This increases the daily error but does not 
cause problems in the forecasting process with a time difference of about 15 minutes which can reduce the 
negative effect. Figure 5 illustrates the estimation error resulting from the difference between measured and 
estimated PV modules as a function of lead time and launch time so it has a similar picture to the previous 
figure, with almost the same percentage error. It can be concluded that it is almost the same as that achieved 
in Figure 5 so it can be stated that: 1) short launch times and moderate lead times result in high scaling, but 
decrease significantly with increasing launch time; ii) launch times of less than one hour result in high error 
rates that appear to be independent of lead time; and iii) high error rates at lead times above 7 hours. 


3.3. Estimated photovoltaic power interval 

Additional information for prediction intervals over a reasonable range of PV power generated at a 
given location and confidence level is selected by the user. Taking a prediction interval can increase the 
uncertainty in the point estimation and can avoid unexpected energy shortage or, conversely, energy 
overload, which is less important than the former because the inverter can change its operating point to 
produce the required energy, although wasting the required energy resources can be overused. In this paper, 
the prediction intervals are obtained based on the operations performed by [47]. The results depend on the 
accuracy of the estimation of the delay time and launch time. The factual results used aim to divide the 
forecast data set and create groups, assuming that a certain distribution is built on the MAE. This group 
division is determined when choosing the launch time and lead time where approximately 365 samples per 
batch are obtained in a full year. Figure 5 shows different error distributions for launch time values of 2, 4, 
and 6 hours, with lead time values of 1, 2, and 3 hours. In the assumed Laplacian distribution, this is similar 
to the work done by the CCF function. The prediction interval for each subset can be determined by the MAE 
assuming that it is for the Laplacian distribution [48]. A more detailed subdivision can be determined with 
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the groups selected as a function of the CCF but must take into account the number of groups as presented 
earlier with insufficient samples from each group to create a proper error distribution [48]. 

Mitigate this weakness by reducing the number of CCF groups to three and using the type of day 
classification described above. The study size with CCF has an hourly decision with a value of zero when the 
sun is not obscured by clouds and one when the sun is completely covered by clouds. The k-nearest neighbor 
(k-NN) method is usually used to form clusters that allow the dataset to be divided simply and offer an 
independent solution for each object in the VPP. Assuming that for the Laplacian distribution, each new 
subset selected carries an error that needs to be measured. Coverage probability interval prediction [49] 
indicates the share of predicted values that fall within the selected interval and are close to the confidence 
level. The confidence level chosen in this study is 80%, although this number can be modified depending on 
the operational risk to be addressed by high-risk objects and thus the higher the usefulness of the tool. 


3.4. Discussion 

The technical development of VPP should be supported by current EMS technologies with PV 
modules expected to be a very important part. The energy generated by each VPP node referring to 
renewable sources allows for optimization of the expected benefits of energy exchange with the grid 
operator. It is difficult to predict PV modules when it is necessary to collect information from several nodes 
spread over a large area especially when the input data required to predict is expensive. This research 
proposes a way to achieve this goal by using a strategy based on LSTM-RNN by forecasting the GHI using a 
dataset of solar radiation values obtained from direct satellite data and then the solar energy utilized by PV 
installations [50], [51]. First of all, it provides results for the GHI estimation for installation based on hold 
time and launch time, which allows regions with lower errors and high confidence levels to be generated in 
the form of day-type-dependent prediction intervals. The GHI error is a function of the hold time and launch 
time, which shows low behavior when the launch time is lower than 1.5 hours, based on sunrise. To avoid 
this, the forecasting process can start 1.5 hours after sunrise. This proposed study can rely on a few days 
ahead prediction made to obtain solar radiation forecasts [52]. An assessment of the accuracy and rigor of the 
safety forecasts and the results were compared with the literature, with results similar to those obtained from 
deep learning algorithms and outperforming existing traditional techniques. 

The difference between waiting time and launch time makes it possible to compare which one is 
better about the literature, but it is difficult to conclude if the study is done with only one value, where MAE 
is related to each other without considering waiting time and launch time with a value of 44.18 W/m? which 
has similarities with other studies. After the solar irradiance is estimated, the PV is then converted and 
calculated analytically with minimized errors, which are 2.55-4.04% for the rMAE value and 3.45-5.95% for 
the rRMSE. The shape of the error matrix shows similar results to those presented above, so it can be 
concluded that the generalized and performed MAE, in this case, is 137.22 W facilitated by 2.97 kWp PV 
[53]. The prediction interval is chosen once it is possible to obtain an estimate of the available PV power 
within an acceptable range of point estimate values. The method considers the Laplacian error distribution 
and distinguishes between waiting time, launch time, and day type, which are selected by using the 
instructions of the k-NN as a CCF function. The confidence level can be maintained by verifying the 
calculated PICP and a value of 80% is obtained. These results show that there is a clear difference between 
the PICP and the confidence level on cloudy days before sunset, but for predictions at these hours, it is not 
very important, so it can be concluded that the chosen prediction interval is very relevant. 

The results show that the reduction of RMSE error is influenced by several important factors 
affecting accuracy and the strategy used [54]. PV generation systems are one of the easiest and most cost- 
effective renewable energy sources (RES) that can be utilized in households and it is possible to convert PV 
modules into customizable PLTS nodes [55]. Ultimately, indirect prediction predictions for PV system 
performance models are required to obtain solar power generation predictions. Strategies used to predict 
performance under irradiation and temperature conditions should be adopted [56]. At the specified 
operational location of PV panels, alternative methods can be obtained [57] and can improve accuracy, while 
other studies use a calculated whole-population model to simplify the process [58]. Finally, PV power 
estimates were made and prediction intervals were selected for PV module facilities. It can be concluded that 
the VPP environment and PV facilities at the weather station can be simulated with an accuracy of 12.36% 
against MAE and 11.85% against RMSE. 


4. CONCLUSION 

Based on the results of the discussion above, it is concluded that the forecasting model for solar 
power sources as a virtual solar generator uses statistical analysis and evaluation with MAE and RMSE. 
These two models are generally used to calculate the prediction error of a data series, with the advantages 
and disadvantages of each model, so that the modeling decision is determined by the type of data to be 
managed in the study. Power prediction research at PV generators and prediction intervals were chosen for 


Solar power forecasting model as a renewable generation source on virtual power plants (Suwarno) 


710 m) ISSN: 2302-9285 


PV module facilities where the VPP model environment and PV facilities at weather stations could be 
simulated with an accuracy of 12.36% (0.1236) against MAE and 11.85% (0.1185) against RMSE. For 
forecasts derived from MAE, the distribution given by dividing the predictions is a function of lead time, 
launch time, and type of day which will be very useful at the high level of accuracy required for prediction. 
The predictions obtained with forecasting errors in this study are much smaller at short waiting times. The 
comparison strategy described in this paper regarding estimation, namely with a short waiting time, will 
produce better estimates compared to traditional methods which have similarities with other studies. The 
error matrix form shows results similar to those presented above, so the MAE method is more commonly 
used in research cases. Modeling assumes that the change is linear with time with SF decreasing at sunset, 
and also varies by the season of the year to get rMAE values, for sunny days, partly cloudy. Observations 
based on RMSE give results on sunny, partly cloudy, and cloudy days. Apart from MAE and RMSE, the 
main GHI is also calculated to find the error values obtained to maintain the same performance as obtained in 
the previous section by using errors to build prediction intervals. Which metric is best depends on the use 
case and research data set. However, RMSE is usually the preferred metric over MAE for measuring model 
performance. This is because developers often want to reduce the appearance of large outliers in their 
predictions and the MAE is considered too simple to understand the overall model performance. 
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